FLOATING POINT UNIT FOR MULTIPLE DATA ARCHITECTURES 



BACKGROUND OF THE INVENTION 

The present invention relates to computer systems and architecture, and more 
particularly, to a computer system having a floating point unit ("FPU") for supporting 
multiple floating point architectures. 

Computer software is typically designed to utilize one of several floating point 
architectures, such as either IEEE Binary Floating Point Standard 754-1985 ("IEEE- 
754"), or IBM®-S/390® hexadecimal floating point standard. See "Enterprise Systems 
Architecture/390 Principles of Operation" (Order No. SA22-7201-02, IBM®, 1994); and 
"IEEE standard for binary floating-point arithmetic, ANSI/IEEE Std 754-1985." 

Software applications that are designed for one floating point architecture may not 
be compatible with another floating point architecture. However, it is often desirable to 
run the same application on different hardware platforms supporting different floating 
point architectures, or to use data generated by an application on one platform with a 
different application on another platform that supports a different floating point 
architecture. Moreover, it is often desirable to run applications that are based on different 
floating point architectures on a single machine. These needs are particularly relevant to 
multi-tasking environments where task switching between different applications may 
require using different floating point architectures. 

The representation of a floating point number is different between IBM®-S/390® 
hexadecimal based architecture and IEEE-754 compliant binary architecture. In an 
IBM®-S/390® hexadecimal architecture format, a floating point number, Nha, is 
represented by 

Nha = (-l) s * l6 (Ex P° nent - HexBias) x O.Fraction (1) 

The hexadecimal Exponent field is 7 bits in length for all formats and the Fraction field is 
24 bits in length for Short format, 56 bits in length for Long format or 1 12 bits in length 
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for Extended format. In an IEEE binary architecture format a floating point number, N B a, 
is represented by 

N BA = (-l) s x 2 (Exponent - Bina ^ Bias) x 1 .Fraction (2) 

The binary Exponent field is 8, 1 1 or 15 bits in length and the Fraction field is 24, 53 or 
1 13 bits in length for the Single, Double or Double Extended formats, respectively. The 
values of the exponent biases for the two architectures are also different. In particular, 
the value of the hexadecimal architecture Exponent Bias is 64 for Short, Long and 
Extended formats. The value of the binary architecture Exponent Bias is 127 for the 
Single format, 1023 for the Double format and 16,383 for the Double Extended format. 

It is desirable to create an FPU for a computer system that supports multiple 
floating point architectures. Multiple floating point architecture support should be 
provided without resorting to duplication of floating point hardware and without 
sacrificing performance. In addition, such an architecture should be transparent to the 
greatest extent possible in order to simplify the interaction between the user and the 
system. 

Accordingly, there is a need for a single FPU that supports both a hexadecimal 
based IBM®-S/390® hexadecimal architecture and an IEEE-754 compliant binary 
architecture. It is also desirable to create a single floating point unit that supports both 
architectures efficiently, without sacrificing performance. It would be inefficient in terms 
of hardware usage to implement two floating point units, one for each format. Some 
previous attempts converted all binary operands into a hexadecimal-like format and 
operated internally on just one format. However, performance was sacrificed in these 
implementations because two additional cycles were required to format binary input into 
hexadecimal, and then reformat a hexadecimal result into binary. Other previous 
attempts optimized the data-flow for binary, and converted only the hexadecimal 
operands into the binary format. However, this penalized hexadecimal operands with 
format conversion cycles. 
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SUMMARY OF THE INVENTION 

An embodiment of the invention is directed to a computer system with a floating 
point unit ("FPU") for supporting multiple floating point architectures. Multiple floating 
point architectures are supported by an FPU with an internal data-flow format that 
accommodates formats of each architecture. The system includes a format converter for 
converting between the internal data flow format and the architected external data types 
by multiplexing the exponent. 

The system includes a floating point unit having an internal data-flow according 
to an internal floating point format for performing floating point operations. The internal 
format has a number of exponent bits sufficient to support each of the plurality of floating 
point architectures and the internal format has a number of fraction bits sufficient to 
support each of the plurality of floating point architectures. 

The system also includes format converters for converting the exponent value of 
each floating point architecture into the internal floating point format so that a data 
operand of any of the floating point architectures input to the floating point unit is 
converted into the internal floating point format for subsequent arithmetic operations, and 
the result of the operation is converted back into the original floating point architecture 
by converting the exponent value of the result from the internal floating point format into 
the original floating point architecture. 

BRIEF DESCRIPTION OF THE DRAWING 

FIG. 1 provides an overview of the data-flow through a floating point unit of a 
computer system in accordance with an embodiment of the present invention. 

DETAILED DESCRIPTION OF THE INVENTION 

FIG. 1 provides an overview of the data-flow through the floating point unit of a 
computer system embodying the present invention. This exemplary embodiment 
supports both IBM®-S/390® and IEEE-754 architectures. The system includes a floating 
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point execution unit ("FPU") 10, a localized input format conversion multiplexor 
("IFCM") 12 for receiving the input data in external format, and input format conversion 
control ("IFCC") 14 for receiving external format information corresponding to 
architecture type and data format length. The IFCC 14 controls the IFCM 12 to cause the 
IFCM to output the floating point data to an input data register ("EDR") 16 in an 
intermediate masked format. The IDR 16 receives the intermediate data for transfer to 
the FPU 10. The FPU 10 internally processes the floating point data through a single 
data-flow path, as indicated by the single arrows within the FPU 10. The processed 
floating point data is received from the FPU 10 by an output format conversion 
multiplexor ("OFCM") 18. Output format conversion control ("OFCC") 20 uses the 
external format information of architecture type and data format length to control the 
OFCM 18 to output the data to a result register ("RR") 22 in a format corresponding to 
the format of the original external data. Thus, output data in the original external format 
becomes available from the RR 22. 

The FPU 10 uses a single execution data-flow path to execute both IEEE-754 and 
IBM®-S/390® based floating point instructions. Thus, the data-flow path supports two 
different floating point architectures. A shared internal binary architecture is used with 
two different biases to simplify the conversion process for both of the supported external 
architectures. The format conversion process to and from the internal binary based 
formats does not adversely impact performance. 

The internal formats corresponding to the supported external architected formats 
are listed below. The internal floating point data format corresponding to the 
hexadecimal external format is 

Nhi = (-l) s x 2 (Exponent - 32 ' 768) x O.Fraction (3) 

and the internal floating point data format corresponding to the binary external format is 

Nbi = (-l) s x 2 (Exponent ' 32 < 767) x 1 .Fraction (4) 
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where the Exponent and Fraction fields of the internal format are 16 and 56 bits in length, 
respectively. The format conversion that takes place from the external format to the 
internal format requires no additional delay and minimal hardware. This format 
conversion takes place as the data is received into the IDR 16 of the FPU 10. The 
multiplexing that is performed to obtain the fraction at the IFCM 12 can be utilized to 
efficiently implement this format conversion without incurring additional clock cycles. 

The exponent conversion from external format to internal format accounts for the 
difference in the biases, as well as for the difference in the exponent bases. The required 
conversion can be derived for Binary Architected Format to the Binary Internal Format as 
follows. Xba represents the value of X in a binary architected format and X b i the value of 
X in a binary internal format. Then, 



(5) 



Thus, 




(6) 



Equating like parts of Equation (6) yields: 



Xba(sign) = X b i(sign) 



(7) 



Xba(fraction) = X b i(fraction) 



(8) 



Xba(exp) - Architected Binary Bias = X b i(exp) - 32,767 



(9) 



and 



Xw(exp) = Xba(exp) - Architected Binary Bias + 32,767 



(10) 
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The Architected Binary Exponent Bias is in the form of 2 (n l) - 1, where n is the 
length of the external exponent field. In particular, n = 8, 1 1 or 15 for short, long or 
extended data formats, respectively. The internal Binary Exponent Bias is in the form of 
2 (m l) - 1 where m is the length of the internal exponent field. In this embodiment, m = 16 
bits and the corresponding internal binary bias is 2 (I6_1) - 1=32,767. The Binary Exponent 
conversion that is required can be derived as follows. 

Xbi(exp) = Xba(exp) - Architected Binary Bias + {2 (m " 0 -1 } (11) 

X bi (exp) = X ba (exp) + {2< m -»> - 1} - {2<*' l) - 1} (12) 

and 

X b i(exp) = X ba (exp) + 2 (m ' !) - 2 (n (13) 

Similarly, the architected Hexadecimal Exponent Bias is in the form of 2 (n * °, 
where n = 7 bits. The internal Hexadecimal Exponent Bias is in the form of 2 (m " °, where 
m = 16. However, the Hexadecimal Architected Exponent is an exponent for base 16 and 
the Base of the internal FPU (reference numeral 10 in FIG. 1) data-flow is base 2. Thus, 
the external hexadecimal exponent and bias must be multiplied by 4 since l6 (exponent) = 2 (4 * 
exponent) Hexadecimal Exponent conversion that is required can be derived as follows. 
Xha represents the value of X in a hexadecimal architected format and X h i the value of X 
in a hexadecimal internal format. Then, 

X ha = X hi (14) 

Thus, 

(.^XhaCsign) x 16 (Xha(exp)-64) x O.X ha (fraCt) = 

J )Xhi(sign) x 2 (Xhi(exp) - 32,768) x ().X h j(fraCt) ( 1 5) 
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(.^Xha(sign) x 2 (4"Xha(exp)-256) x O.X ha (fraCt) = 

(.^XhKsign) x 2 (Xhi(exp)- 32,768) x O.X hi (fraCt) 



(16) 



Equating like parts of Equation (16) yields: 



Xha(sign) = Xhi(sign) 



(17) 



Xha(fract) = X h i(fract) 



(18) 



4 x X ha (exp) - 256 - X hi (exp) - 32,768 



(19) 



and 



X h i(exp) = 4x X ha (exp) + 2 (m - 1) - 2 (n+1) 



(20) 



It should be noted that the expression {2 15 - 2 X }, where X<15 is equal to 2 14 + 2 13 
+ 2 12 + 2 11 + ... + 2 X . When this value is represented as a 16 bit binary number, the most 
significant bit is a 0 since X is a non-negative integer. Subsequent bits of lower 
significance are 1 's up to and including bit number X + 1 . The remaining bits of lower 
significance are zeros. For example, the expression {2 15 - 2 7 } is equivalent to the binary 
number Oil 11 11 110000000. 

As seen above, the format conversion process requires that the value of the 
external exponent be added to the adjustment term which is in the form {2 15 -2 X }. If we 
perform this addition in binary, we note that this addition has an interesting property. 
The most significant bit of the external exponent is of the value 2 X because the external 
exponent is X + 1 bits in length. If the most significant bit of the external exponent is a 1, 
then there is a carry out of that bit position and ripples all the way to the most significant 
bit of the result. The binary result will have the most significant bit on if the most 
significant bit of the external input is on. The subsequent 15 - X bits will be active if the 
most significant bit of the external exponent is not on. The remaining X bits will simply 
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be the least significant X bits of the external exponent. Therefore, the exponent 
conversion from binary external format to binary internal format is given by the 
following expression. 



Internal Exponent(O) = Architected Exponent(O) 



(21) 



Internal Exponent(l :15 - (n -1)) = (NOT Architected Exponent(O)) n (NOT 
Architected(O)) 11 ... II (NOT Architected Exponent(O)) 



(22) 



Internal Exponent(15 - (n -1):15) = Architected Exponent(l:n) 



(23) 



where n is the length of the architected exponent format. 

As noted in the conversion equations for the architected hexadecimal exponent 
above, it can be seen that the exponent format conversion from hexadecimal architected 
to internal is also the same process. However, since the external hexadecimal exponent is 
a base 16 exponent, it needs to be multiplied by four or left shifted by 2 positions to 
convert it to a binary exponent. The exponent format conversion for hexadecimal is 
given by the following expression. 

Internal Exponent(O) = Architected Exponent(O) (24) 



Internal Exponent(l:7) = (NOT Architected Exponent(O)) n (NOT Architected(O)) 



Note that the sign bits and fractions do not need any conversions and are equivalent in 
either internal or architected format. 
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I! ... II (NOT Architected Exponent(O)) 



(25) 



Internal Exponent(8:15) = Architected Exponent(l:6) n '00' 



(26) 
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The format conversion from internal to external takes place in the final stage of 
the pipeline just before the result register. The format conversion process is simply the 
reverse of the process that was performed to convert to the internal format. The process 
is achieved by controlling the result register input multiplexing. The selection is defined 
as follows. 

Architected Binary Exponent(O) = Internal Exponent(O) (27) 
Architected Binary Exponent(l :n) = Internal Exponent(l 5 - (n -2): 1 5) (28) 

and 

Architected Binary Fraction(0:y-1) = Internal Fraction(l :y) (29) 

Also, 

Architected Hexadecimal Exponent(O) = Internal Exponent(O) (30) 
Architected Hexadecimal Exponent(l :6) = Internal Exponent(8: 13) (31) 

and 

Architected Hexadecimal Fraction(0:y - 1) = Internal Fraction(0:y - 1) (32) 

where y is the length of the architected fraction field. 

As also shown in FIG. 1, the FPU 10 receives floating point data from the IDR 16, 
and then implements input radix point correction ("IRPC") 24 followed by normalization 
by the pre-aligner ("PRA") 26. The arithmetic unit ("AU") 28 then performs the desired 
arithmetic manipulations of the data in internal binary format in a manner well known to 
those of ordinary skill in the pertinent art. The resulting data is received in internal 
binary format for output radix point correction ("ORPC") 30. Intermediate data is then 
received by the post-aligner ("POA") 32 for subsequent transfer from the FPU 10 to the 
OFCM 18. 
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It should be noted that the POA 32 is designed to normalize on hexadecimal digit 
boundaries when operating on hexadecimal data. There is no special requirement for the 
pre-normalization process of the PRA 26 since it is controlled by the internal exponent 
which is already on a hexadecimal boundary. Also, the exponent logic of the AU 28, 
5 designed to handle floating point addition alignment, is capable of handling two different 

biases and locations of implied radix points. This complication does not impact the cycle 
time and is only a slight hardware area increase since the exponent data-flow is much 
smaller than the fraction data-flow. 

An advantage of a Floating Point Unit of the invention is that the physical 
10 hardware requirements are reduced over units having separate data paths for multiple data 

architectures. 

Another advantage of a Floating Point Unit of the invention is that the time delay 
for conversion of data into the internal architecture from at least one of the supported 
external architectures is reduced over units that require conversion of the fractional 
1 5 portion of an external architecture. 

A further advantage of a Floating Point Unit of the invention is that the cost of the 
unit having a single data path is less than the cost would be for the additional hardware 
required to implement multiple data paths. 

Another advantage of a Floating Point Unit of the invention is that the space 
20 requirement of the unit having a single data path is less than that of the additional 
hardware required to implement multiple data paths. 

An additional advantage of a Floating Point Unit of the invention is that the power 
usage of the unit having a single data path is less than that of the additional hardware 
required to implement multiple data paths. 
25 A further advantage of a Floating Point Unit of the invention is that the cost of the 

unit that does not require conversion of the fractional portion of external architecture data 
into an internal architecture is less than the cost would be to produce hardware fast 
enough to compensate for the time delay required for conversion of the fractional portion 
of external architecture data into an internal architecture. 
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While preferred embodiments have been shown and described, various 
modifications and substitutions may be made thereto without departing from the spirit 
and scope of the invention. Accordingly, it is to be understood that the present invention 
has been described by way of illustration only, and such illustrations and embodiments as 
have been disclosed herein are not to be construed as limiting to the claims. 
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